E-zyme: predicting potential EC numbers from the chemical transformation pattern of substrate-product pairs
نویسندگان
چکیده
MOTIVATION The IUBMB's Enzyme Nomenclature system, commonly known as the Enzyme Commission (EC) numbers, plays key roles in classifying enzymatic reactions and in linking the enzyme genes or proteins to reactions in metabolic pathways. There are numerous reactions known to be present in various pathways but without any official EC numbers, most of which have no hope to be given ones because of the lack of the published articles on enzyme assays. RESULTS In this article we propose a new method to predict the potential EC numbers to given reactant pairs (substrates and products) or uncharacterized reactions, and a web-server named E-zyme as an application. This technology is based on our original biochemical transformation pattern which we call an 'RDM pattern', and consists of three steps: (i) graph alignment of a query reactant pair (substrates and products) for computing the query RDM pattern, (ii) multi-layered partial template matching by comparing the query RDM pattern with template patterns related with known EC numbers and (iii) weighted major voting scheme for selecting appropriate EC numbers. As the result, cross-validation experiments show that the proposed method achieves both high coverage and high prediction accuracy at a practical level, and consistently outperforms the previous method. AVAILABILITY The E-zyme system is available at http://www.genome.jp/tools/e-zyme/.
منابع مشابه
Generalized reaction patterns for prediction of unknown enzymatic reactions.
Prediction of unknown enzymatic reactions is useful for understanding biological processes such as reactions to external substances like endocrine disrupters. To create an accurate prediction, we need to define a similarity measure in the reaction. We have developed the KEGG RPAIR database which is a collection of chemical structure transformation patterns, called RDM patterns, for substrate-pr...
متن کاملPathPred: an enzyme-catalyzed metabolic pathway prediction server
The KEGG RPAIR database is a collection of biochemical structure transformation patterns, called RDM patterns, and chemical structure alignments of substrate-product pairs (reactant pairs) in all known enzyme-catalyzed reactions taken from the Enzyme Nomenclature and the KEGG PATHWAY database. Here, we present PathPred (http://www.genome.jp/tools/pathpred/), a web-based server to predict plausi...
متن کاملMetabolome-scale de novo pathway reconstruction using regioisomer-sensitive graph alignments
MOTIVATION Recent advances in mass spectrometry and related metabolomics technologies have enabled the rapid and comprehensive analysis of numerous metabolites. However, biosynthetic and biodegradation pathways are only known for a small portion of metabolites, with most metabolic pathways remaining uncharacterized. RESULTS In this study, we developed a novel method for supervised de novo met...
متن کاملRPAIR: A Database of Chemical Transformation Patterns in Enzymatic Reactions
Chemical genomics is the next stage of post-genomic analysis. Drugs, environmental substances and various chemical compounds contribute to the fluctuation of bio-systems. Therefore, chemical genomic analysis would require the investigation of relationships between genomes and their extracellular environments. These relationships between bio-systems and environments include complicated biochemic...
متن کاملECOH: An Enzyme Commission number predictor using mutual information and a support vector machine
MOTIVATION The enzyme nomenclature system, commonly known as the enzyme commission (EC) number, plays a key role in classifying and predicting enzymatic reactions. However, numerous reactions have been described in various pathways that do not have an official EC number, and the reactions are not expected to have an EC number assigned because of a lack of articles published on enzyme assays. To...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Bioinformatics
دوره 25 شماره
صفحات -
تاریخ انتشار 2009